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1. Introduction and statement of results 

Let Q be a smooth hypersurface in R 3 and let go be a point on Q. Let da(q) 
denote the standard surface measure on Q. For a small cutoff function <fi supported near 
go, we define the maximal operator M, initially on Schwarz functions, by 

M/(x)=sup| / f(x-tq)<j)(q)da(q)\ (1.1) 

t>0 JQ 

Our goal is to determine for which p is the maximal operator M bounded on L p . Note that 
by subadditivity of maximal operators of the form (1.1), one can prove L p boundedness of 
the nonlocalized analogue of (1.1) over a compact surface by doing a partition of unity and 
reducing to (1.1). If L is an invertible linear transformation and Ml denotes the maximal 
operator corresponding to the surface L(Q), then one can easily check from the definitions 
that Mi,f(x) — |<iet(L)|M(/oL)(L _1 x). Hence when studying L p boundedness properties 
one may always replace M by Ml,. In particular, we may assume that near go, Q is given 
as the graph of a function g(x,y) such that if (xo,yo) denotes the point in the x-y plane 
which g lies above, then Vg(xo,yo) = 0. 

The earliest work in this area was done in the case where Q is an n-dimensional 
sphere in R 7t+1 , when Stein [Stl] showed M is bounded on L p iff p > for n > 1. This 
was later generalized by Greenleaf [Gr] to surfaces of nonvanishing Gaussian curvature, 
with some further results for when the Hessian has rank between 1 and n. The n = 1 case 
was later proven by Bourgain [B]. In [So] Sogge showed in any dimension that whenever 
Q has at least one nonvanishing principal curvature, M is bounded on L p for all p > 2. 
The case of convex surfaces of finite line type has been extensively analyzed; we refer to 
[IoSe2] and [NaSeWa] for more information on these situations. 

Although there are many interesting issues when p < 2, for the purposes of this 
paper we always assume p > 2. M is trivially bounded on L°°, and if M is bounded on 
some L p , by interpolating with the L°° case one has that M is bounded on L p for p' > p. 
Hence our goal is to determine the optimal po > 2 for which M is bounded on L p for 
p > Pq. If Q is tangent to the tangent plane T qo (Q) to infinite order at go, then as long as 
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T qo (Q) a relatively straightforward argument shows that M is unbounded on LP for 
all finite p. Conversely, if the Gaussian curvature of Q does not vanish to infinite order at 
qo, then by [SoSt] M is bounded on LP for some finite p. (See [CoMa] for another result of 
this kind.) It is entirely possible that in any dimension, M is bounded on some finite LP 
whenever Q is not tangent to T qo (Q) to infinite order at qo. Hence in general we expect 
Po to be finite. 

Definition 1.1. Let d(x,y) denote the vertical distance between Q and T qo (Q) above 
(x, y). The height h(qo) is defined to be the reciprocal of the supremum of all e for which 
the integral of \d(x, y)\~ e is finite on at least one neighborhood of q . 

For the case n = 2 considered in this paper, a good LP boundedness theorem for 
p > 2 was proven in [IkKeMu2]. Their main theorem can be stated as follows. 

Theorem [IkKeMu2] . Suppose the origin is not contained in T qo (Q) . If <fi(q) is supported 
on a sufficiently small neighborhood of qo then M is bounded on LP for p > max(h(qo) , 2). 
When h(qo) > 2 and (j)(qo) ^ this exponent is sharp in that M is unbounded on LP for 
p < h(qo) and if Q is real-analytic, then M is unbounded on L h ( qo ^ as well. 

The purpose of this paper is to provide a relatively short alternative approach to 
the p > 2, n = 2 situation by extending the methods of [Gl] and using facts about the 
adapted coordinate systems described below. There will once again be exceptional cases 
not covered, but due to the differences in the methods the exceptional cases will be quite 
different from and not mutually exclusive to the exceptional cases of [IkKeMu2], which 
occur when G T qo (Q). 

Newton Polygons and Adapted Coordinates. 

We now give some relevant terminology which will be used throughout this paper. Below, 
R(x, y) denotes a smooth function defined on a neighborhood of the origin with nonvan- 
ishing Taylor expansion at the origin. 

Definition 1.2. Let R(x,y) = ^ a b R a bX a y b denote the Taylor expansion of R(x,y) at 
the origin. For any (a, b) for which R a b ^ 0, let Q a b be the quadrant {(x,y) G R 2 : x > 
fl, y > b}. Then the Newton polygon N(R) of R(x,y) is defined to be the convex hull of 
the union of all Q a b- 

In general, a Newton polygon consists of finitely many (possibly zero) bounded 
edges of negative slope as well as an unbounded vertical ray and an unbounded horizontal 
ray. 

Definition 1.3. The Newton distance d(R) of R(x,y) is defined to be mf{t : (t,t) G 
N(R)}. 
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One often uses (ti,t 2 ) coordinates to write equations of lines relating to Newton 
polygons, so as to distinguish from the x-y variables of the domain of R(x, y). The line in 
the 1 1-^2 plane with equation t\ = t 2 comes up so frequently it has its own name: 

Definition 1.4. The bisectrix is the line in the t\-t 2 plane with equation t\ = t 2 . 

A key role in the above theorems as well as our theorems to follow is played by 
the following polynomials. 

Definition 1.5. Suppose e is a compact edge of N(R). Define R e (x,y) by R e (x,y) = 
S(a b)ee RabX a y b . In other words R e (x, y) is the sum of the terms of the Taylor expansion 
of / corresponding to (a, b) G e. 

Definition 1.6. Suppose R(x,y) has nonvanishing Taylor expansion at the origin such 
that _R(0, 0) = and Vi?(0, 0) = 0. Then R(x, y) is said to be in nonadapted coordinates if 
the bisectrix intersects N(R) in the interior of a compact edge e of N(R) such that R e (l, y) 
has a zero of order greater than d(R). If R(x,y) is not in nonadapted coordinates, then 
R(x, y) is said to be adapted coordinates. 

The significance of adapted and nonadapted coordinates was first discovered by 
Varchenko [V] for the real-analytic case and for the general smooth case by Ikromov-Muller 
[IkMu]. Namely, define eo to be the supremum of all e for which |-R| _e is integrable in at 
least one neighborhood of the origin. Equivalently, eo is the supremum of the epsilon such 
that on some neighborhood of (0,0) one has |{(x,y) : \R(x,y)\ < t}\ < Ct e for some C. 
Then by [V] and [IkMu] one has d(R) < j^, with equality holding if and only if R{x,y) 
is in adapted coordinates. Furthermore, one has the following. Suppose R(x,y) is not in 
adapted coordinates and let e be the edge of N(R) intersecting the bisectrix in its interior. 
By [V] and [IkMu], if the slope m e of e is at least —1, then there is a smooth ift(x) with 
■i/>(0) = such that R(x, y + ip(x)) is in adapted coordinates. By switching the roles of the 
x and y axes, this means that if m e < —1, there is a smooth ip(y) with ip(0) = such that 
R(x + ip(y),y) is in adapted coordinates. Thus [V] and [IkMu] show that there necessarily 
is a " nice" coordinate change after which the growth rate of R at the origin is given in the 
above way by and where R is in adapted coordinates; these facts in turn are used in 
[IkKeMu2] in their proof of the L p boundedness properties of M in their theorem above. 
Another useful aspect of adapted coordinates proven in [V] and [IkMu] is that if one is in 
adapted coordinates, then the order of any zero of any R e (l,y) (cf. Definition 1.5) is at 
most d(R). 

Let a denote the order of the zero of R(x,y) at (0,0). Then for a generic linear 
transformation T, d£(i?oT)(0, 0) and d£(RoT)(0,0) are both nonzero. Thus the Newton 
polygon of R o T is entirely on or above the line ti+t 2 = a. But no point of N(R o T) can 
be below this line; otherwise r oT would have a zero of order less than a at the origin. We 
conclude that N(R o T) is exactly {(t±, t 2 ) G R 2 : t± + 1 2 > a}. Since the compact edge of 
N(RoT) has slope —1, by the above discussion there is a ip(x) such that RoT(x, y + ifj(x)) 
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is in adapted coordinates. Note that the Newton polygon of RoT(x,y + tp(x)) still has its 
upper vertex at (0, a) and that the slope of each edge of this Newton polygon is at least 
-1. This motivates the following definition. 

Definition 1.7. Suppose #(0,0) = and V J R(0, 0) = 0. Then R(x,y) is said to be in 
generic adapted coordinates if R(x,y) is in adapted coordinates, each edge of N(R) has 
slope at least —1, and N(R) intersects the y-axis at some point (0,a). 

Note that the above definition implies that R(x,y) has a zero of order a at the 
origin. We now are in a position to state the main theorem of this paper. Recall we are 
working in the situation where Q is a surface with a distinguished point go = (xo, yo, zq), 
such that near g the surface Q is the graph of some g(x,y) with g(xo,yo) = zq and 
Vg(xo,yo) = (0,0). Let f(x,y) = g(x + xo,y + yo) — zq, and define D(x,y) to be the 
Hessian determinant of / at (x, y). Then our main theorem is as follows 

Theorem 1.1. Suppose M is as defined in (1.1). If (p(q) is supported on a sufficiently 
small neighborhood of qo, then M is bounded on L p for p > max(/j(go), 2) as long as 
neither of the following two exceptional situations occurs. 

a) D{x, y) has a zero of infinite order at (0, 0). 

b) Whenever T is an invertible linear transformation and ift(x) a smooth function with 
a zero of order b > at the origin such that F(x, y) = (/ o T)(x, y + ifi(x)) is in generic 
adapted coordinates, then the bisectrix intersects N(F) in the interior of a compact edge 
e with slope m e with |m e | < \ such that p(y) = d y (F e (l,y)) or d y (F e (— 1, y)) has a zero 
of order greater than max(l, d(F) — 1) at some yo for which p(yo) ^ 0. 

Here F e (x, y) is as in Definition 1.5. If ifj(x) has a zero of infinite order at x = 0, 
then we take \ = in Theorem 1.1b); in other words, the exceptional condition of b) will 
not be satisfied. As is explained in [IkKeMu2], using a theorem in [IoSal] one can show 
the exponent h(qo) is best possible when go ^ T qo (Q), assuming 4>{qo) ^ 0. If go € T qo (Q), 
then sometimes one can do better as the maximal operator starts to resemble a traditional 
Hardy-Littlewood maximal operator in two dimensions. 

The exceptional situation a) of this paper is necessitated by our use of damping 
functions in conjunction with the theorem of Sogge and Stein [SoSt] that we will describe 
below. It is unlikely that it can be avoided without using substantial additional ideas. A 
canonical example of when the first kind of exception situation occurs is when F(x, y) = 
p(ax + by), in other words, when F(x, y) is effectively a function of one variable. 

Exceptional situation b) may be viewed as rare in the sense that it requires a 
certain polynomial to have a zero of high order among other things; however, a simple 
example of where it happens is the function (y + x a ) b + x c for c > b > 2, a > 1, and ab < c. 
Although we will not prove it here, further manipulations involving Newton polygons 
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can be used to show that if the exceptional condition holds for one of d y (F e (l, y)) and 
d y (F e (—l, y)), it holds for the other. Hence in the statement of Theorem 1.1 we could 
have just used one of the two functions. 

The exceptional situation b) arises for the following reason. The main theorem of 
[Gl] gives Theorem 1.1 if f(x,y) is already in adapted coordinates. Much of the analysis 
of [Gl] carries over even in nonadapted coordinates; the y- variable shift by ip(x) does 
not interfere with most of the argument. The exception occurs when one cannot avoid 
using integrations by parts in the x- variable in arguments resembling the proof of the Van 
der Corput lemma. This happens when the bisectrix intersects N(F) in the interior of a 
bounded edge satisfying the above conditions on F e (l, y) and its y derivatives. If the order 
b of the zero of ip(x) at the origin is at least j^-p then ip(x) is too small to cause any 

serious problems in such integrations by parts. If b is less than -^j— j- then it seems difficult 
to adapt the arguments of [Gl] to these situations. So as long as we can find some linear 
T such that / o T avoids such situations, then the methods of [Gl] can be adapted to the 
current situation and Theorem 1.1 can be proved. 

The stipulation that b < j^-j- is more than a technical improvement in the state- 
ment of Theorem 1.1. For example, if f(x,y) is a mixed homogeneous function, then it is 
not hard to show that in converting to adapted coordinates ip(x) can always be taken to 
be of the form cx b , and if the bisectrix of F(x,y) = f(x,y + cx b ) intersects N(F) in the 
interior of a compact edge of slope m e then b = Hence Theorem 1.1 covers the mixed 

homogeneous case (as long as D{x,y) does not have a zero of infinite order at the origin) 
because we include the condition. For h(q ) > 2 this result was first proved in [IkKeMul]. 

Strategy of the Proof. 

Ever since [SoSt] one successful method of proving LP boundedness theorems for 
maximal operators such as (1.1) has involved embedding M in an analytic family M z . 
The idea is that one replaces the standard surface measure da(q) with the damped surface 
measure e z \h\ z da(q), and then defines the maximal operator M z to be the analogue of 

2 

(1.1) with da(q) replaced by e z \h\ z da(q). One shows that for some so < 0, M z is bounded 
on L°° whenever Re(z) > so, uniformly in \Im(z)\ for fixed Re(z), and that for some 
si > 0, M z is bounded on L 2 whenever Re(z) > s±, again uniformly in \Im(z)\ for fixed 
Re(z). The e z factor is present to ensure uniform L 2 bounds on each vertical line. Then 
by a well known interpolation technique for maximal operators (see Ch 11 of [St2] for 
details), one obtains a po > 2 such that M = Mq is bounded on LP for p > po. The hope 
is that the damping function h(z) can be chosen so that po is optimal. 

In the above interpolation, the L°° bounds are typically obtained using the ob- 
servation that | \M Z \ l^oo^^oo is bounded by the L 1 norm of C\h\ Re ^ z \ Thus as long as 
\h\ s °da(q) is a finite measure, one obtains the desired uniform L°° bounds on any vertical 
line Re(z) = s for s > s . For the L? boundedness, we will use the following consequence 



5 



of a theorem of Sogge and Stein: 

Theorem 1.2. [SoSt]. Suppose the surface measure (p(q)da(q) is as in (1.1) and there 
are C, e > such that for all multiindices a with \a\ = 0, 1 the Fourier transform of the 

2 

measure da z (q) = e z \h(x)\ z 4>(q)da(q) satisfies 

|<9<V,(A)|<C(1 + |A|)-^ (1.2) 
Then there is a constant C depending on C and e such that ||M 2 /|| 2 < C'||/||2 for all 

In practice, if one has (1.2) for a = 0, it will generally automatically hold for 
all the first derivatives since the effect of taking such a derivative is to replace the cutoff 
function by another one. In the proof of Theorem 1.1 in this paper, we will work in this 
framework. We will first write the surface Q near q as the union of finitely many " slivers" 
containing go on their boundaries. These slivers will be defined using the Newton polygon 
of the function f(x, y) defined above Theorem 1.1 when put in generic adapted coordinates; 
we will effectively be doing a coarse resolution of singularities using Lemmas 2.2 and 2.3 
of the next section. We will then define the damping function separately on each sliver. 
The damping functions will be analogues of those of [Gl]. There will be four types of 
slivers and showing (1.2) holds will be done by separately estimating the contribution to 
ef z (A) coming from each type of sliver, again using methods analogous to those of [Gl]. It 
should be pointed out that the idea of dividing a neighborhood of a point into slivers with 
respect to a Newton polygon for the purpose of proving oscillatory integral estimates such 
as (1.2) is quite old; it appears in [PSt] and its predecessors for example and was also used 
in analyzing such maximal operators in [IkKeMul] and [IkKeMu2] . 

2. Lemmas about Newton polygons; subdivisions into slivers 

In the analysis of this paper, for the appropriate F(x, y) one treats on similar 
footing all compact edges of N(F) intersecting the set {(^1,^2) : > t±}. To avoid 
exceptional situations such as those of part b) in the statement of Theorem 1.1 for any 
such edge not intersecting the bisectrix in its interior, we have the following lemma. 

Lemma 2.1. Suppose R(x,y) is a smooth function on a neighborhood of the origin that 
is in generic adapted coordinates. Let d* denote max(2, d(R)). Suppose e is any compact 
edge of N(R) lying entirely on or above the bisectrix. Then if there is yo 7^ such that if 
_R e (l, yo) 7^ and d y R e (l, y) has a zero at yo of order greater than d* — 1, then e has slope 
— 1 and upper vertex lying on the y axis. 

Proof. Since R(x, y) is in generic adapted coordinates, the uppermost vertex of N(R) 
is (0, a) for some a > 2. The point (d(R),d(R)) is on N(R), and the line of slope — 1 
containing this point intersects N(R) at (0, 2d(R)). Since all edges of N(R) have slope at 
least -1 and (d(R),d(R)) G N(R), we conclude that a < 2d(R) with equality holding iff 
there is an edge of N(R) connecting (d(R),d(R)) to (0, 2d(i?)). Hence if e is a compact 
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edge of N(R) lying entirely on or above the bisectrix, either e is the segment (d(R) , d(R)) 
to (0,2d(R)) or a < 2d(R). Thus if e satisfies the assumptions of this lemma and we can 
show that R e (l, y) has degree at least 2d(R), then in particular e has slope — 1 and upper 
vertex lying on the y axis as needed. 

So assume e is a compact edge lying entirely on or above the bisectrix such that 
there is yo 7^ with R e (l, yo) 7^ and such that d y R e (l, y) has a zero at yo of order greater 
than d* . First consider the case where e's lower vertex is (d(R), d(R)). In particular, d(R) 
is an integer. We will show that R e (l,y) has degree at least 2d(R). Note that R e (l,y) 
has a zero at y = of order d(R) and therefore d y R e (l,y) has a zero at y = of order 
d(R) — 1. Since we are assuming it also has a zero at yo of order greater than d(R) — 1, 
d y R e (l,y) must have degree least 2d(R) — 1. Hence R e (l,y) has degree at least 2d(R). 
This means that the upper vertex of e is at least 2d(R) and by the previous paragraph we 
are done. 

Next, we consider the case where e does not contain (d(R),d(R)). Hence e lies 
entirely above the bisectrix, and i? e (l, y) can be written as y d p(y) where d' > d(R). This 
means that d y R e (l, y) is of the form y d ~ 1 q(y). Since d y R e (l, y) is assumed to have a zero 
yo 7^ of order greater than d(R) — 1, we can write 

d y R e (l, y) = y d '-\y - y ) d " _1 r(y) (2.1) 

Here d" > d(R). Note that (2.1) implies that the degree of d y R e (l, y) is at least d'+d"— 2 > 
2d(R) — 2, so the degree of R e (l, y) is greater than 2d(R) — 1. Hence the upper vertex of e 
must be the upper vertex of N(R); otherwise N(R) would have a vertex at height greater 
than 2d(R) and since a < 2d(R) this can't happen. We conclude the upper vertex of e is 
given by (0, a) for some 2d(R) — 1 < a < 2d{R). As a result, the degree of d y R e (l,y) is 
greater than 2d{R) — 2 and at most 2d(R) — 1. 

If r had positive degree, then the degree of d y R e (l, y) would be at least d'+d"— 1 > 
2d(R) — 1, contradicting the above. So r{y) is constant and we may write 

d y R e (l,y) =cy d '-\y-y ) d "- 1 (2.2) 

Similarly, if d' or d" were equal to d(R) + l or greater then the degree of d y R e (l, y) would be 
greater than 2d(R) — 1, again giving a contradiction. So we have d(R) < d', d" < d(R) + 1. 
Since d' and d" are both integers, this means d' = d" . "Homogenizing" (2.2), for some k 
we get that 

d y R e (x,y) =cy d '- 1 (y-y x k f~ 1 (2-3) 

Looking at the term of (2.3) whose degree in y is second-highest we see that k is an integer. 
Hence e is an edge of slope — ^ whose upper vertex is (0,2d' — 1) and our objective is to 
show that k = 1. Assume k > 2; we will arrive at a contradiction. Since we are dealing 
with the case that e lies entirely above the bisectrix, the point (d(R),d(R)) lies above the 



7 



line containing e. Since this line intersects the bisectrix at (j^(2d' — 1), -j^{2d' — 1)), we 
conclude that 

^(2d' - 1) < d(R) (2.4) 
Since d! > d(R) and > |, this in turn implies that 

|(2d(i2)-l) < d(i?) (2.5) 

Equivalently, d(R) < 2. Since d! < d(R) + 1 and cZ' — 1 is an integer at least one, d! — 1 = 1 
and (2.3) just becomes 

d y R e (x,y) = cy(y-y x k ) (2.6) 

Now note that d y R e (l, y) no longer has a zero of order greater than one, so it no longer falls 
under the assumptions of this lemma. So this situation cannot happen; we have arrived at 
a contradiction and we are done. 

Lemma 2.2. Suppose R(x, y) is a smooth function on a neighborhood of the origin such 
that -R(0, 0) = 0. Suppose v is a vertex of N(R) that is the intersection of compact 
edges ei and e 2 with slopes > m\ > m^- Let M\ = — — and M 2 = — — . Let 
Rcd,x c y d denote the term of the Taylor expansion of R(x, y) at the origin corresponding to 
v. Then on a sufficiently small neighborhood of the origin, there is an N > such that if 
A^|x| Ml < \y\ < jj\x\ M2 , then we have 

\\RcdX c y d \ < \R(x t y)\ < 2\R cd x c y d \ (2.7) 

If v lies on the y axis and is the upper vertex of a compact edge e± with slope — then 
there is an iV > such that if iV|a;| Ml < \y\ then once again (2.7) holds. 

Proof. Without loss of generality, we may restrict our attention to (x, y) in the upper right 
quadrant. Write the Taylor expansion of R{x,y) at the origin as ^ ab R a bX a y h '. We first 
prove (2.7) in the case where v is the intersection of two compact edges, whose equations 
we denote by t\ + Mit 2 = «i and t\ + M 2 t2 = «2- For a large K we can write 

R(x, y) - R cd x c y d = ^ R ab x a y b 

(a,b):c<a<M, d<b<M, (a,b)^(c,d) 

+ R ab x a y b + Yl R ab x a y b + E K (x,y) 

(a,b):a<c, d<b<M, a+M 2 b>ct 2 (a,b):c<a<M , b<d, a+Mi6>ai 

(2.8) 

Here E K (x,y) satisfies 

\E K (x,y)\<C(\x\ K + \y\ K ) (2.9) 

We start by noting that the first sum in (2.8) is less than \\r c d\x c y d in absolute value if 
(x, y) is in a sufficiently small neighborhood of the origin, which we may assume. As for 
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the second sum, if one changes coordinates from (x,y) to (x,y r ), where y' = x M2 y, then 
(x, y') G [0, 1] x [0, jj] whenever y < jjX M2 . Observe that under this coordinate change, a 
given term R a \,x a y b of the second sum becomes R a bX a+M2b (y') b . Since a + M26 > oli and 
b > d in each term in the second sum, the entire sum can be written as x a2 (y') d (y' 'f(x, y')) 
for some f(x,y') which is a polynomial in y' and a fractional power of x. Thus if N is 
sufficiently large, whenever y' < jf the sum is of absolute value less than | \R c d\x a (y') d = 
\\R c d\x c y d . Since y' < is equivalent to y < jjX M2 , these are the bounds we need. 

The third sum is dealt with in exactly the same way, reversing the roles of the 
x and y axes and the edges e\ and e^. Lastly, since jjX M2 > y > Nx Ml the error term 
Er(x, y) is less than \\r c d\x c y d in absolute value for small \y\ if K is chosen sufficiently 
large. Putting these all together, we get that \R(x,y) — R c dX c y d \ < \\R c d\x c y d as needed. 
This completes the proof of Lemma 2.2 for the case where (c, d) is the intersection of two 
compact edges of N(R). 

We now move to the case where (c, d) is on the y-axis and is the upper vertex of a 
compact edge e\ of N(R). We again examine the sum (2.8). In the case at hand, since (c, d) 
is on the y-axis, the second sum in (2.8) is empty. The second sum is where the condition 
y < jjX M2 was used above, and the third sum is where the condition y > Nx Ml was used. 
Since we have no second sum, the lack of a condition y < j^x M2 holding does not cause 
any problem in repeating the above argument. For the third sum we use the condition 
y > Nx Ml exactly as before, and for the first and fourth sum the previous argument works 
unmodified. Hence (2.7) holds again and we are done. 

Lemma 2.3. Suppose R(x, y) is a smooth function of y and a fractional power of x on a 
neighborhood of the origin such that -R(0, 0) = 0. Write the Taylor expansion of R(x, y) at 
the origin as b R a bX a y b . For a given M > let Rm(x, y) denote the sum of the nonzero 
terms of this Taylor expansion for which a + Mb is minimized; in particular RM(x,y) is 
either of the form R e (x, y) for a compact edge e of N(R) or is equal to R c dX c y d for a vertex 
(c, d) of N(R). Denote this minimal value of a + Mb by a. Then for any r G R and any 
e > 0, there is a 5 > such that on the set {(x, y) G R 2 : < x < 5, (r - 5)x M < y < 
(r + S)x M } we have 

\R(x,y)-R M {x,y)\<ex a (2.10) 

Proof. On the region {(x, y) G R 2 : < x < 5, (r — 5)x M < y < (r + 5)x M } : we do the 
coordinate change (x, y) = (x, x M y'), converting the region into the box (0, 5) x (r— 5, r+S). 
In the new coordinates, the finite Taylor expansion R(x, y) = J2 a ,b<K 

R ab x a y b + 0(\x\ K + 

\y\ K ) becomes of the form 

R(x, x M y') = x a R M (l, y') + x a +<s(x, y') + 0(\x\ K + \x\ KM \y'\ K ) (2.11) 

Here £ > and f(x, y') is a polynomial in y' and a fractional power of x. For any e' > 0, if 
5 is sufficiently small we have \Rm(1, y') — Rm(1, r)\ < e' for all \y' — r\ < 5. Equivalently, 
|i?M(l,y / ) — RM(l,r)\x a < e'x a . Furthermore, if 5 is sufficiently small x a+< ^\s(x 7 y')\ and 
the 0(\x\ K + \x\ KM \y'\ K ) term are less than e'x a whenever x and \y' — r\ are sufficiently 
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smalll. Combining, if 5 is sufficiently small then on our domain we have 

\R(x,x M y') -x a R M (l,r)\ < 3e'x a R M (l,r) (2.12) 

Translating this back into the original coordinates, we have 

\R{x, y) - R M {x, y) \ < 3e'x a R M {l, r) (2.13) 

Taking e = 36'Rm(^,t) gives us the lemma and we are done. 

We now are in a position to set up the proof of the main theorem, Theorem 1.1. 
Recall we have a surface Q with a distiguished point (xq, yo, zq) that is the graph of some 
smooth function g(x,y) defined near (xo,yo) such that V^(0,0) = (0,0). Suppose the 
assumptions of Theorem 1.1 hold. Then, after a linear coordinate change if necessary, we 
may assume f(x, y) = g(xo + x, yo + y) — zq has a generic adapted coordinate system on a 
neighborhood of the origin for which the exceptional situation b) of Theorem 1.1 does not 
occur. Therefore there is a smooth tfj(x) with i(j(0) = such that F(x, y) = f(x, y + if}{x)) 
is in generic adapted coordinates, and if N(F) intersects the bisectrix in the interior of a 
compact edge e then e does not satisfy the exceptional situation b) of the statement of 
Theorem 1.1. 

Definition of slivers for F(x 7 y) 

We now use Lemmas 2.2 and 2.3 on F(x,y) and its various y derivatives to 
subdivide a small neighborhood B of (0,0) into "slivers" containing the origin. The case 
where N(F) has exactly one vertex (which is therefore on the y-axis) is easier and will be 
treated separately, so in the following we always assume N(F) contains multiple vertices. 
Denote the vertices of N(F) above the bisectrix by vi,...,Vk where if i < j then Vi is 
below Vj. Let denote the edge of N(F) whose upper vertex is vf, if vi is the lowest 
vertex of N(F) then we just do not define e±. Let rrii = —j^- denote the slope of e^. 
Write Vi = (a^, bi); observe bi > 2 for all i since (eij, bi) lies above the bisectrix and we are 
assuming F(0,0) = and VF(0,0) = (0,0). 

For Vi that is the intersection of two compact edges of N(F), define Di to be the 
set {(x,y) E B : N \x\ Mi < \y\ < 7^-|^| Mi+1 }- Here iVo is large enough so that we may 
invoke Lemma 2.2 and say there are cq, c\ > such that (assuming B is small enough) for 
m = 0, 1, 2 on Di we have 

am rp 

ci|*Hy| 6< -™ > \-^(x,y)\ > c \xr\y\ b *- m (2.14) 

It should be pointed out that if bi = m = 2, then (2.14) holds by applying Lemma 2.3, 
reversing the roles of the x and y axes and setting r = 0. If Vi is the upper vertex of N(F), 
we define Di to be the points where ATol^l^ < \y\, in which case (2.14) still holds on Di 
by Lemma 2.2. 
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We next subdivide the set B — UiDi into some slivers touching the origin amenable 
to the analysis of this paper. We only describe the slivers for x > 0; the ones where x < 
are defined analogously. Note that the points of B — Ui-Dj where x > can be written as 
UjCi, where 

d = {(x,y) E B : x > 0, \y\ < N x Ml } (2.15a) 

d = {(x, y)eB:x>0, ^-x Mi < \y\ < N x Mi } (i > 1) (2.156) 

(In the event that v\ is the lowest vertex of N{F), one takes M 2 in (2.15a) and then 
(2.156) is valid for i > 2). Suppose r is such that F ei (l, r) ^ 0, but d y F ei (l, r) has a zero 
of order greater than d* — 1 = max(2, d(F)) — 1 at r. Then if intersects the bisectrix in 
its interior, by assumption the exceptional case of Theorem 1.1 part b does not occur. If 
ej does not intersect the bisectrix in its interior, then by Lemma 2.1 has slope -1 and 
intersects the y-axis. In either event, if i = 1 and \r\ < N or i > 1 and < |r| < iVo, we 
define E ir to be the sliver 

E ir = {(x,y) E B : x > 0, (r - 5 r )x Ml < y < (r + 5 r )x M *} (2.16) 

Here 5 r is a small constant to be determined by our future arguments. We will refer to 
the (finitely many) Ei r occurring as Eij in the rest of this paper. 

For any r other than these, we may let 1 < k < d* and 5 r ,C r > be such 
that on [r — 5 r ,r + S r ] we have \dyF e . (1, y)\ > C r . If F e . (l,r) 7^ this follows from 
the above definition of the and if F e .(l,r) = it follows from the fact that any zero 
of any F e . (l,y) has order at most d(F) in adapted coordinates. As a result, on the set 
B r = {(x, y) G -B : a; > 0, (r — S r )x Mi < y < (r + 5 r )x Mi }, given that contains (a^, 6^) 
we have 

\d$F e .(x,y)\ > C r x a * +M ^- k) (2.17) 
By applying Lemma 2.3 to dyF(x,y) we can assume B is small enough that we also have 

\d k y F{x,y)-d k y F ei {x,y)\ < (218) 
Putting (2.17) and (2.18) together, on B r we have 

\d k F(x,y)\>^x a ^ +M ^- k) (2.19) 

By compactness, we can write B — UiDi — UijEij as the union of finitely many slivers on 
which (2.19) is satisfied. For a given edge e^, we write the slivers for which k = 1 as F^ 
and the slivers for which k > 1 by Gij . We denote the value of k corresponding to a given 
Gij by kij. Note that each kij < d*. 

The above decompositions were for the case where N(F) had more than one 
vertex. When N(F) has exactly one vertex, since F it is in generic adapted coordinates it 
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is of the form (0, k). In this case, we simply designate a neighborhood of the origin as a 
single Gij, with kij = k. In general, the arguments for this Gij will be simplified versions 
of the arguments for the multivertex case. 

Let v (F) denote the set of vertices of N(F), and define F*(x, y) by 



(vi,v 2 )£v(F) 

The function F*(x,y) will be used in defining the damping function. To this end, first 
note that N((F*) 2 ) is the double of N(F) and therefore d((F*) 2 ) = 2d(F). As a result, 
by [V], F*(x,y)~ l is integrable on a neighborhood of the origin iff t < ^d((F*) 2 ) = d(F). 
We apply Lemmas 2.2 and 2.3 to F*{x,y) 2 in place of F(x,y) and obtain that if the Nq 
in the definition of Di were chosen sufficiently large, then there is a constant Cq > such 
that on each Di we have 

-^\x ai y h \ < F*(x,y) < C \x ai y bi \ (2.20) 
Co 

Similarly, by Lemma 2.3 on each Eij, Fij and G^, the constant Co can be taken so that 
we have 

i-|x|° i+Mi6i < F*(x,y) < C \x\ a * +M ^ (2.21) 
Co 

We now subdivide the surface Q near our distinguished point qo in accordance with the 
above subdivisions, applied to the function f(x,y) in generic adapted coordinates. In 
other words, we let ip{x) be such that F(x,y) = f(x,y + ifj(x)) is in generic adapted 
coordinates such that the exceptional cases of Theorem 1.1 do not hold, and define Di, 
E^, F^, and Gij to be the above slivers as defined for F(x,y). We next transfer these 
slivers into the original coordinates of the surface Q; let D\ be the portion of Q above the 
set {(x, y) : (x — xq, y — yo — ip(x — xq)) G Di}, with the analogous definitions for E'-, F[j, 
and G\j. We also will have use for Di in the original nonadapted coordinates, centered 
at (xo,yo). To that end we let D" = {(x,y) : (x, y — ip(x)) G Di}, making the analogous 
definitions for E'( 3 , F/j , and G"j. 

The next lemma will be useful in bounding the contribution of our integrals over E'(j in 
the L 2 estimates of section 4. 

Lemma 2.4. There is a constant C such that on E'/j we have \d xx f(x, y) \ > C\x\ ai+hiMi ~ 2 . 

Proof. We consider slivers for which x > as the x < slivers are entirely analogous. 
Recall each Eij is a region of the form {(x, y) : < x < fx, (r — v)x Mi < y < (r + v)x Mi }, 
where F £i (1, r) ^ 0, but where d l y F ei (1, r) = for at least / = 1, 2. Under the map (x, y) — > 
(x, y — rx Mi ), the set E^ becomes the region E'/J = {(x, y) : < x < [i, \y\ < vx Mi }, and 
if G{x,y) denotes F(x,y + rx M >) then G Ci (l,0) ^ but 0'G Ci (l,O) = for / = 1,2. 
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In terms of Newton polygons, the above can be translated as follows. Since 
is an edge of N(F) with equation x + M^y = a^ + Mj&j, N(G) has an edge with the 
same equation which goes all the way to the x-axis since G ei (l,0) 7^ 0. As a result, for 
/ = 1, 2 N(d l x G) has an edge with equation x + Miy = cii + Mibi — I which extends to 
the x axis. On the other hand, since d y G ei (l, y) has a zero of order at least two at 0, for 
/ = 1, 2 N(d l y G) intersects the line with equation x + Miy = cii + (Mi — l)bi but does not 
contain (a^ + (Mi — l)bi, 0). Using this fact for / = 1 and taking an x derivative shows that 
N(d xy G) intersects the line with equation x + Miy = cii — 1 + (Mi — l)bi but does not 
contain (a^ — 1 + (Mi — 1)6^, 0). Using (2.11) in conjunction with these latter observations 
concerning the Newton polygons, we obtain that for any 77 > 0, if v were chosen sufficiently 
small, then on E"! for / = 1, 2 we have 

\d l y G(x,y)\ < n\x\ ai+ ( Mi -Q bi (2.22a) 

\d xy G(x,y)\ < ^xl'i-i+iMi-m (2.22b) 

On the other hand Lemma 2.3 in conjuction with the above conditions on the Newton 
polygons of the cc-derivatives ensures that for some c\ > 0, on E'/J for / = 1, 2 we have 

\d l x G(x,y)\ > Cl \x\ a * +M ^- 1 (2.22c) 

Next, as mentioned after (2.15), Eij is only defined in two situations. The first is when 
intersects the bisectrix in its interior and the function ifj(x) such that F(x, y) = f(x, y + 
ip(x)) has a zero of order at least Mj at x = 0. In the second situation, does not intersect 
the bisectrix in its interior, but by Lemma 2.1 Mj = 1 and thus %l)(x) still has a zero of order 
at least Mi at x = 0. In either case, since f(x, y) = F(x, y — ifj(x)) = G(x, y-ifj(x) +rx Ml ), 
we can write f(x, y) = G(x, y + £(x)) where £(x) has a zero of order at least Mj at x = 0. 
Applying the chain rule, we get that on E'(- we have 

d xx f(x, y) = d xx G(x, y + £(x)) + 2£ / (x)d xy G(x, y + £(x)) 

+e'(x)d y G(x, y + £(*)) + (£' '(x)) 2 d yy G(x, y + £(*)) 

Equation (2.22c) ensures that \d xx G(x, y+£(x))\ > ci\x\ ai+Mibi ~ 2 : and equations (2.22a) — 
(2.226) coupled with the fact that has a zero of order at least Mj at x = ensure 
that the remaining terms can be made less than any r]'\x\ ai+Mibi ~ 2 . We conclude that 
\d xx f(x,y)\ is at least £±.\ x \ ai+Mibi ~ 2 on E'(- if v were chosen appropriately small. This 
completes the proof of the lemma. 

Definition of the damping factor. 

We now define the damping factor on the surface Q in a neighborhood of qo = 
(xq, yo, g(xo, yo))- Above a point (x, y) it will be of the form e 2 \h(x, y) \ z \D(x, y)\ Sz , where 
D(x,y) is the Hessian determiant of g at (x,y) and 5 is a small positive number to be 
determined by our arguments. The function h(x, y) will be defined in the form H(x — 
xo,y — yo — ip(x — xq)), where H(x,y) is expressed in terms of F(x,y) = f(x,y + ip(x)). 
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To this end, once again let d* = max(2, d(F)). Since F(x,y) is in adapted coordinates, 
d(F) — h(q ) and thus equivalently we have d* = max(2, h(q )). 

On each Di, Eij, Fij, as well as on each Gij with kij = 2, we define H(x,y) to 
be F*(x,y)2~d r . On the remaining G^, if N(F) has just one vertex we let H(x,y) = 

|fj^(x,2/)|3, while if N(F) has multiple vertices let H(x,y) = |:r| M<_ 1 d* 1 1 (x,y)\%. 
One thing worth mentioning concerning these latter H(x,y) is the following. Since \y\ < 
C\x\ M * on Gij, by (2.11) with a = at + MA, on Gy one has < C\x\ ai+Mi <- bi - 2 \ 

As a result, \H(x,y)\ < C\x\^ +Mibi ^^~ & and in view of (2.21) this gives 

\H(x,y)\<CF*(x,y)t-& (2.23) 
Note the right-hand side of (2.23) is exactly CH(x,y) for the other regions. 
3. L°° estimates 

Define the operator M z to be the maximal operator (1.1) with respect to the 
measure e z \h(x, y)\ z \D(x, y)\ Sz 4>(q)da(q) in place of 4>(q)da(q). Note that for z = 0, 
M z is exactly M. These will be the analytic family of maximal operators used in proving 
Theorem 1.1 as described at the end of section 1. We first prove the L°° to L°° boundedness 
properties of the M z we need. 

Theorem 3.1. Write d = d(F). If d > 2, then for any s > — -j^, if 5 is sufficiently small 
(depending on s) then there exists a constant C such that | |M 2 | l^oo^^oo < C for all z with 
Re(z) = s. If d < 2, the same holds for any sGR. 

Proof. We will use the fact that | \M Z \ \ ^00^™ is bounded by the L 1 norm of the damping 
function of M z . We consider the case d = d(F) < 2 first. Note that for each Di, Eij or Fij 
the function h(x, y) is given by F*(x — xq, y — yo — ip(x — xo))^ - ^ = 1. Since in adapted 
coordinates each is at most d* < 2 here, for each G^ that may appear h(x, y) is also 

2 c 

just 1. We conclude that the damping factor always equal to e z \D(x,y)\~ . On a given 
vertical line Re(z) = s, this has magnitude bounded by C s \D(x, y)\~ Ss . Since D(x,y) is 
assumed to be of finite type at the origin, there is some e > for which \D(x, y)\~ £ is 
integrable on a neighborhood of the origin. Hence as long as 5 < -, the damping factor is 
integrable, with integral uniformly bounded on Re(z) = s. This is exactly what we needed 
to prove. 

Now suppose d > 2. On any D\, E[-, F/-, or G'^ with = 2, h(x, y) = F*(x - 
xo, y — yo — ^>(x — xo)) and thus the damping factor has magnitude F*(x — xq, y — yo — ifj(x — 
iCo))*- 5 ~d)- Re ( 2 ) \D(x, y)\ SRe ( z \ As mentioned at the end of section 2, a result of Varchenko 
says that F*(x, y)~ l is integrable on a neighborhood of the origin iff t < ^. Thus the same 

is true for F*(x— xo, y—yo—^(x—xo))~ t . As aresult, F*(x— xo, y—yo—ip(x—xo))^~^ Re ^ 
is integrable on a neighborhood of the origin iff Re(z) > — ^z^- Consequently, by Holder's 
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inequality, for fixed s > —jz^, by choosing 5 sufficiently small we have that when Re(z) = s 
the damping factor is integrable over any D'^ E'^, F^, or G\j with kij = 2, with integral 
uniformly bounded in Im(z). This is what we need here. 

We now move on to the G\a with kij > 2. If N(F) has one vertex, we use the fact 

that 7fjjr(x,y) has nonvanishing (k^ — 2)th derivative in the y direction, where kij < d. 

Thus if Re(z) = s > — -^z^, then \H(x, y)\ z = \ ^j§-(x,y)\% is integrable in y with integral 
uniformly bounded in Im(z). Making 5 sufficiently small and using Holder's inequality 
again gives the desired result. Suppose now N(F) has multiple vertices. We will consider 
those Gij for which x > as the Gij for which x < are done in the same way. In the Gij 
coordinates we can write the damping function as 



I o-i + Mib, Q^F 

\D(x,y)\ 5z x M ^^r^\^(x,y)\ 



Recall that Gij is of the form {(x, y) : < x < rj, (r — 5)x Mi < y < (r + 5)x Mi }. Analogous 
to (2.1 1 ), on the box (0, 77) x (r — 5, r + 5) for any K one can write 

F(x,x m Y) =x ai F M .(l,Y) + x a * +c s(x,Y) + 0(\x\ K + \x\ KM *\Y\ K ) (3.1a) 

Here s(x, Y) is a polynomial in Y and a fractional power of x. Analogous expressions hold 
for various y derivatives of F; for example, we can write 

^(x,x Mi Y) = x Qi - 2M< ^F Mi (l,y)+^- 2Mi+c S >,y)+0(| : r| K +| a ;| KM< |y| K ) (3.16) 

The constant 5 was chosen small enough that \dy ij F Mi (l, Y)\ > C on (r - 5, r + 5) for 
some positive C, where kij < d. As a result, shrinking r\ if necessary we can assume that 

d 2 F 



on (0, rj) x (r - 5, r + 5) we have x Mi Y) = x ai 2Mi S(x, Y) where 



\dy' 3 2 S(x,Y) \ > C (3.2) 

We now let x = so that ^(X 1 ^ 1 , X^+^Y) is of the form X M i+ 1 T(X,Y) 

where 

\dy J ~ 2 T(X : Y)\ > C (3.2') 

We make this coordinate change so that the change from (x, y) coordinates to (X, Y) has 
constant Jacobian determinant. This ensures that a power of the damping function is 
integrable in the (x, y) coordinates iff it is integrable in the (X, Y) coordinates. In the 

old coordinates, the damping function is \D(x, y)\ Sz times x^ 1 a — ) z | @^(x, y) \i so 
in the new coordinates it is of the form 



(J 1 ' (i If, b ■ -2M 4 



\D(X, Y)\ Sz x X Z \X^+^^(X, Y)\- 
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d^--a i -M i b i fp' r P 

= \D(X,Y)\ Sz x X *\__( X ,Y)\t (3.3) 

Note that by (3.2') the function §^(X, Y) has (fc^- — 2)th derivative uniformly bounded 
below in the y variable, for fixed x, where k{j < d. As a result, if Re(z) = s > —^2, then 

Y)\% is integrable in y with integral uniformly bounded in Im(z). Thus for such 
z we have 

/ / \ x { d( M i+ i) )\*\-—(X,Y)\*dxdy<C x s "c^+d cte (3.4) 

</r-<5 JO io 

This will be finite if the exponent of x is greater than —1. Substituting aij = cti + Mibi, the 
exponent in (3.4) is s^ a \+^ bi . Note that a \+^ i&i is the ^-coordinate of the intersection 
of the bisectrix with the edge e i: which is at most d. Hence ° i+m ~ < Therefore 
if s > the exponent in (3.4) is greater than —1 and thus the right-hand factor of (3.3) 
is integrable for Re(z) = s, uniformly in Im(z). As in the previous argument, by making 
5 in the \D(X, Y)\ 5z factor sufficiently small by Holder's inequality the same will be true 
for the entire damping factor (3.3). This what we needed to prove and we are done. 

4. L 2 estimates. 

We now move to proving the L 2 bounds needed in the proof of Theorem 1.1. 
As indicated in section 1, we will be utilizing Theorem 1.2 that follows from [SoSt]. Let- 
ting (f)(q)da(q) be the surface measure of (1.1), we define the measure a z by da z (q) = 
e z \h(x,y)\ z \D(x,y)\ (p(q)da(q). Since we will using Theorem 1.2, we examine its Fourier 
transform cf 2 (A), given by 

a z (\) =e z " J e- lXl9{x > y) - lX2X - lX3y \h(x,y)\ z \D(x,y)\ Sz (j)(x,y)dxdy (4.1) 

Here <J)(x, y) denotes some cutoff function on a neighborhood of the origin. We always shift 
by (xo, 2/0)7 so that our integrals are over a small neighborhood of the origin. Thus up to 
an ignorable factor of magnitude 1, (4.1) is given by 

a z (\) =e z2 J e- lAl/(x ' y) - jA2X - lA3y |^(x,y-^(x))| z |D(x,y)| 5 ^*(a;,y)^rf ? / (4.2) 

In (4.2), tfj(x) is the function taking f(x,y) into its adapted coordinates and D(x,y) now 
denotes the Hessian determinant of / at (x, y). For the analysis of the part of (4.2) coming 
from the D", F/j, and G"a, we will transfer into the adapted coordinates of f(x, y) and use 
Van der Corput-type arguments in the y-variable. For the E'/j we will remain in the original 
coordinates, and use a Van der Corput-type argument in the x-variable in conjunction with 
Lemma 2.4. 

We will prove that the conditions of Theorem 1.2 hold by virtue of the following 
theorem, whose proof will comprise most of the rest of this paper. 
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Theorem 4.1. Suppose s > 1. Then if the constant S used in the exponent of \D(x,y)\ is 
sufficiently small, then there are constants C, e independent of Im(z) such that if Re(z) = s 
then for any multiindex a with \a\ = 0, 1 we have 

\d a a z (X)\ < C(l + |A|)-i" e (4.3) 

Proof. We will only prove (4.3) for \a\ = as the \a\ = 1 cases are identical other than 
having a different cutoff function (f>(x,y). Recall that we are assuming that /(0, 0) = 
and V/(0, 0) = 0. So if | A2 1 or | A3 1 is the maximal |Aj| one may integrate by parts in x 
or y respectively and get that |<x(A)| < C|A| _1 , which is better than the estimate that we 
need. Hence for the remainder of this paper we will always assume that |Ai| is at least as 
large as | A2 1 and | A3 1 . 

Let a(x) be an even function on R that is equal to 1 for \x\ < |, zero for \x\ > 1, 
and is monotone decreasing on R + . Let f3(x) = 1 — a(x). For constants Si and Ni to be 
determined by our arguments, we express (4.2) as Ii + 12 + ^3, where 

7i(A) = e z " J e -^f^v)-^-^y\H{x, y - i(j(x))\ z \D(x, y)\ 5z 

xa(\X\ Nl D(x,y))<f)*(x,y) dx dy (4.4a) 
I 2 (\)=e z2 J e- lXlf{x > y) - lX2X - %X3y \H(x,y - i(j(x))\ z \D(x,y)\ 5z 

x (a(\X\ Sl D(x, y) - a(\X\ Nl D(x, y)))<j>*(x, y) dx dy (4.46) 
7 3 (A) = e 2 ' J e - iXlfM - iX2X - iX * y \H(x, y - ifj(x))\ z \D(x, y)\ 5z 

xP(\X\ s iD(x,y))r(x,y)dxdy (4.4c) 

The analysis of /2(A) will be the crux of the argument. The contribution to (4.3) 
due to ii(A) is easily shown to decrease rapidly in |A|. Specifically, since D(x,y) is being 
assumed to be of finite-type in a neighborhood of the origin, if Ni is large enough the 
measure of the points where \D(x,y)\ < \X\~ Nl will be less than ^y. As a result, the 

integrand of ii(A) is nonzero on a set of measure at most . Since all the factors in (4.4a) 

are uniformly bounded on a line Re(z) = s with s > 1, this gives that |ii(A)| < C|A| _1 , 
better than what is needed. 

Bounding |/ 3 (A)|. 

Note that on the support of the integrand of /3(A), the Hessian determinant 
D{x,y) is at least i|A|~ 51 . The idea is that if Si were actually zero, then on this support 
the Hessian would be bounded below and we would get an estimate (/3(A) | < C|A| _1 . 



17 



Although 5i is not zero, if it is sufficiently small we still get an estimate |i" 3 (A) | < C|A| 1 * 
for any given small t, an estimate better than what is needed. 

We proceed as follows. For a sufficiently small c > to be determined by our 
arguments, we divide the support of the integrand of /3(A) into squares of diameter c|A| _(51 . 
We will show that the contribution to /3(A) from each such square is at most c|A|~s if § 1 
is sufficiently small. Adding this over all these squares, this gives an estimate better than 
needed. 
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Let S be any such square. Since D(x, y) is of finite type, we may let u and v 
be nonparallel directions such that for some k, d^D{x,y), d^D{x,y), and d u d^~ 1 D(x,y) 
are nonvanishing on the support of the integrand of /3(A). We can similarly assume that 
there are k' , k" such that d% (F*(x, y - tfj(x)) 2 ), d%' (F*(x, y - ifj(x)) 2 ), d%" f(x,y), and 

f(x, y) are nonvanishing on any S. Let a\ = — d u (-£x+ y^y) and <22 = — d v (j^-x+ j^y)- 
Note a\ and are constants. Define the sets Si, 52, and S3 by 

Si = {(x,y) e S : \d u f(x,y) - a x \ > |A|"*} (4.5a) 

\ (4.56) 
\ (4.5c) 

Correspondingly, write the contributions to /3(A) from Si, S2, and S3 as Ji(A), ./2(A), 
and ./3(A) respectively. To analyze Ji(A), integrate the integrand of (4.4c) by parts in 
the u direction, integrating Xi(d u f(x,y) — ai)e~' lXl ^ x,y ^~' lX2X ~ lX:iV in and differentiating 
x 1 (d f(x jy)-ai) times the remainder of the integrand. We get several terms depending on 
where the derivative lands. If it lands on the 4>(x,y) factor, then each factor in the term 
is bounded above by a constant, with the exception of the x , a ,} — s r factor, which is 

bounded in absolute value by | Ai | 3 on Si. Hence |«/i(A)| is at most C|Ai|~3+ 2<5l 7 which 
is bounded by C|A| _ s, the desired estimate. 

Next, we consider the term where the derivative lands on the , ,» rr — s r fac- 

tor. We take absolute values of the entire integrand and bound it above by m m d f 7^ x ^ ^ ■ 

to J \M (d u f{x,y)-a 1 ) 2 

We integrate this in the u direction as in the proof of the Van der Corput lemma; the as- 
sumed condition that \d* f(x, y) \ is bounded below ensures that we integrate over bound- 

edly many intervals on which d u f(x, y) — a± is monotone and thus (q /(x^'-i^a integrates 

back to (a f(x y )- ai ) ■ We end out with a bound of C| Ai | — §+ 51 < C|Ai|~§. 

If the derivative lands on the \D(x, y)\ Sz factor, we argue similarly. We take 
absolute values and integrate in the u direction, this time using that \d^D(x, y) | is bounded 
below on the integrand to ensure that there are boundedly many intervals on which D(x, y) 
is monotone and thus on which we can integrate back its u-derivative. In the (extremely 
rare) case that only k = can be used, \D(x, y)\ Sz is a smooth function and the term 
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behaves as in the case where the derivative lands on (f)(x,y). If the derivative lands on 
(3(\X\ Sl D(x 1 y)) the argument we just used for the \D(x, y)\ Sz case works. One thing worth 
pointing out is that in these cases the presence of the z in the exponent leads to an 

2 

additional factor of C\Im(z) \ upon differentiation; however, the presence of the e z in the 
damping factor is more than enough to compensate. 

Lastly, we consider the case where the derivative lands on the factor \H(x,y — 
ip(x)) | 2( ^2 ~d*\ Since this factor was defined differently on the different Di, Eij, etc, we 
split the square S into its intersections with the D", E'/p F/j, and G'/j. For anything other 
than a G"j with kij > 2, the damping factor is a power of F*(x,y — ip(x)). The directions u 
and v were defined so that F*(x, y — ip(x)) 2 has some nonvanishing higher order derivative 
in the u and v directions, so one can argue as above, breaking up the one-dimensional 
integration in the u or v variables into boundedly many intervals on which F*(x, y — t(j(x)) 
is monotone. 

On a G'lj with > 2, the damping factor was defined as x Mi ~ l+ d* 1 1 d yy f(x, y). 
We can actually assume that u and v are such that the (kij — 2)th u and v derivatives 

of x u d yy f{x,y) are nonvanishing. To see why, first note that (2.19) gives that 

the (k^ — 2)th y-derivative of d yy f(x,y) is bounded below by Cx ai+Mi ^ bi ~ kij \ On the 
other hand, by Lemma 2.3, (remembering that Mi is always at least 1 in generic adapted 
coordinates) on G'^ we have \d a f(x,y)\ < (j^+ M * b * M *M_ Using these facts with the 
product rule , if u and v are close enough to the y direction, the (k^ — 2)th derivative in 

the u or v direction of x u 3* d yy f(x,y) will also be nonvanishing. Hence one can 
argue as in the previous paragraph and get the same upper bounds as before. We have 
now considered all possible places the derivative lands, concluding the proof of the desired 
upper bounds for |Ji(A)|. 

The bounds for | »/2 (A) J are proven exactly as they were for | Ji(A)|, replacing the 
roles of the u and v variables. The presence of the added condition \d u f(x, y) — ai\ < |A| _ s 
in the domain, which does not have an analogue above, does not interfere with any of the 
above estimates; the condition that d u d^~ 1 D(x,y) is nonvanishing ensures that in any of 
the situations where one takes absolute values and does a Van der Corput type argument 
in the v direction, one still has boundedly many intervals. 

We now move on to Ja(A). Consider the level sets of d u f(x,y) and d v f(x,y). 
The gradients of both functions are bounded below in absolute value by C\D(x, y)\, which 
is at least ||A| -51 on the square S if we chose the constant c in the diameter c|A| _<51 of the 
squares sufficiently small. As a result, if c is small enough the level sets of both d u f(x 7 y) 
and d v f(x,y) do not self-intersect on S. Hence we may use d u f(x,y) and d v f(x,y) as 
coordinates on S. In particular, we may evaluate the measure of the set ^3 of (4.5c) by 
changing into these coordinates in the integral of its characteristic function. The result is 

I S 3 1 < Cmin| J D(a:,y)r 1 |Ar5 (4.6) 
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So we conclude that IS3I < C'\X\ 3+ Sl . Since the integrand of Js(A) is uniformly bounded 
on Re(z) = s for any s > 1, we conclude that 

|J 3 (A)| < C"\\\-? +Sl < C"\X\-i (4.7) 

This gives the needed estimate. Adding the contributions from | Ji(A)|, | ^2 (A) | , and | ^3 (A) | , 
we conclude that the contribution to |is(A)| from the square S is at most C"'\X\~i, and 
since — | < — \ we conclude that |i" 3 (A) | satisfies the bounds we need so long as b\ was 
chosen sufficiently small. 

Estimating |/2(A)|. 

We focus our attention on the main term /2(A), given by (4.46). We divide the 
domain of (4.46) into squares of diameter c| A | <5a , where c and 82 are small constants. For 
a given such square £, denote the corresponding term of /2(A) by /f(A)- We will show that 
if c and 62 are sufficiently small, then for any such S we have |^2 (A)| < C|A|~2~ e 7 where e 
is independent of c and S2, and C is independent of Im(z) for Re(z) = s > 1. Since there 
are at most c'|A| 2<52 squares, as long as we make sure 5 2 < § , this is enough to show that 
/2(A) itself satisfies the bounds needed for Theorem 4.1. This subdivision into squares is 
useful because it allows us to replace D(x,y) by a polynomial approximation of bounded 
degree which is therefore piecewise monotone in a direction in which we are integrating by 
parts, enabling us to use Van der Corput type arguments in such a direction. 

We now perform this polynomial replacement. For a given £ and positive integer 
N, let Dg (x, y) be the polynomial in x and y consisting of the sum of the terms of degree 
at most N of -D(x,y)'s Taylor expansion centered about the center of S. Thus on S we 
have 

\Dg(x,y) -D{x,y)\<C\\\- 5 * N (4.8) 

As a result, on S we have 

In particular, if is chosen large enough we can make the exponent N1 — 82N appearing in 
(4.9) less than -1. Consequently, for the purposes of the analysis of (A) we may replace 
-D(x, y — ip(x)) by Dg (x,y) in the (a(|A| 5l -D(x, y) — a(\X\ Nl D(x, y))) factor; the difference 
will contribute no more than C|A| _1 to I^W, and adding over all squares gives a result 
smaller than the bounds needed in Theorem 4.1. 

We can do something similar for the \D(x, y) \ Sz factor. Namely, suppose N is 
taken large enough that in (4.8) we have 

\D%(x,y)-D(x,y)\<C\\\- 2N i 

Then since \D(x,y)\ > ^\X\~ Nl when the integrand of (4.46) is nonzero, if |A| is large 
enough we may use the Taylor expansion of \x\ Sz about x = D(x, y) to obtain 

\\D(x,y)\ Sz - \D%(x,y)\ Sz \ < C\Im(z)\\X\- N ^ 5Re ^-^- 2Nl (4.9) 
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As a result, since Re(z) > 1, as long as Ni > 1, we have an estimate 

\\D{x,y)\ 5z -\D»{x,y)\ 5z \<C\Im{z)\\\\- 1 (4.10) 

2 

The e z is more than enough to take care of the \Im(z)\ factor in (4.10), and the exponent 
— 1 is less than — \. Consequently, we may replace \D(x, y)\ Sz by \Dg(x, y) \ Sz in the 
analysis of /ff(A); the difference added over all squares S contributes less than the bounds 
needed for Theorem 4.1. 

We have now shown that for the purposes of our future arguments, we may adjust 
our notation and assume is given by 

/f (A) = e z " J e - iXlf ( x >^- iX2X - iX3y \H{x,y - ^{x))\ z \D% {x,y)\ Sz 

x {a(\\\ s *D% (x, y)) - a(\\\ N *D% (x, y)))0*(x, y) dx dy (4.11) 

We divide the domain of integration of (4.11) into the intersections of S with 
the D'l , E'/j, F"j, and G"a and denote the corresponding term of /2(A) by (A), I^ %d (A), 

I 2 lJ (A), and I 2 IJ (A). (Recall D" = {(x,y) : (x,y — ip(x)) E Di} with analogous definitions 
for the other regions). We suppress the S since the bounds we will prove, given in the 
statement of Theorem 4.1, are independent of S. We will only consider those regions for 
which x > as the x < ones are entirely analogous. We now focus our attention on the 
analysis of the (A). 

Bounds for |lf *(A)|. 

Recalling that \H(x,y)\ = F*(x,y)^~~^ on a D i: if we change coordinates from (x,y) to 
(x, y + ip(x)) in (4.11) we obtain 



/f*(A) = e z " [ e -iXiF(x,y)-iX 2 x-i\ 3 (y+i,(x)) p* (x,y)G-fr )e \D% \x,y + V>( 
J SnDi 



x))\ 



Sz 



{a(\\\^D% (x, y + ^{x))) - a{\\\ N ^D% (x, y + y) dx dy (4.12) 

Here (f)**(x, y) denotes a new cutoff function on a neighborhood of the origin, and F(x, y) = 
f(x,y + ift(x)) is in generic adapted coordinates not satisfying the exceptional situations 
of Theorem 1.1. We slightly abuse notation in (4.12) in that S now denotes the square 
in the new coordinates. We now decompose the domain of (4.12) into dyadic rectangles. 
We only consider those rectangles in the upper right quadrant as the other quadrants are 
done the same way. For a given dyadic rectangle Jm = [2~ fc_1 , 2~ fc ] x [2~' _1 , 2~ z ], we use 
the shorthand by Iki to denote the corresponding term of /^(A), given by 



ikl 



f e -iX 1 F(x,v)-iX a x-iX 3 (v+^(,x)) F *^ y ^l-^)z\ D N ^ y + ^(a-))]^ 

JSnDitUM 
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(a(|A|*£# (x, y + iP(x))) - a(\\\ N ^D% (x, y + ^(x))))^(x, y) dx dy (4.13) 

We will analyze (4.13) by imitating the proof of Van der Corput's lemma in the 
y direction. Our objective is to show that (4.13) is bounded by C(l + |A|) _ 2 _e as in the 
statement of Theorem 4.1. The second y derivative of the phase function in (4.13) is given 
by \\d yy F(x,y), and by (2.14), if the vertex of N(F) corresponding to Di is written as 
(a,i,bi), then on Di we have \d yy F(x, y) \ > c\x\ ai \y\ bi ~ 2 . Since x ~ 2~ k and y ~ 2~ l on Jki 
we can write this as 

\9 vy F(x,y)\ > c'j^y 2 (2- k r(2- l t (4.14) 

As in the proof of the Van der Corput theorem for functions with nonvanishing second 
derivative, we will split the integral (4.13) into two parts. The first is the part where 

I |I - a i k -( b j- 2 ) 1 

\XidyF(x,y)+Xs\ < | A| a 2 2 ; and the second is the part where \Xid y F(x, y)+ A3 1 > 

1 —a i k — (b i —2)l 

|Ap2 2 . Call the resulting integrals K\ and K 2 , so that Ki + K 2 = Iki- We will 
bound K\ by taking absolute values and integrating, and K 2 by performing an integration 
by parts. 

We start with K\. The integrand of (4.13) is bounded in absolute value by a con- 
stant times F*(x,y) R < z ^^-^\D§(x,y + tfj(x))\ 5R < z \ By (2.20), F*(x,y) < C\x ai y b * \ < 
C2 - kai -lb^ and on the domain of (4 13 ) we have \Df*(x,y + i){x))\ 5z < C'\X\- 6SlRe ^ 

Hence if s denotes Re{z), the integrand of (4.13) is at most 

Q//\^\-55 1 S2(-ka i -lb i )s(±--±r) (4-15) 

Since by (4.14) the absolute value of the y-derivative of Xid y F(x,y) + A3 is at least 

c / |A|2 _fcai_ ^ bi_2 ) we have 

1 -a,-fe-(b,-2)i 1 fea,-+((b,- -2) 

\{y: \X 1 d y F(x,y) + X 3 \ < |A|*2 * }| < |A|"32 * (4.16) 

Thus bounding the y integral of K\ by (4.15) times the measure (4.16) and then integrating 
the result in x, we obtain 

\Ki\ < c"'\X\~^~ SSlS 2 ( '~ kai ~ lbi ^ £ ^ 1 ~~^' ) 2~ k ~ l (4-17) 

We now turn to K 2 and show that K 2 also satisfies the upper bounds of (4.17). We 
integrate the integrand in (4.13) by parts in y, integrating the factor (Xid y F(x,y) + 
X \ e -i\iF(x,y)-i\ 2 x-i\ 3 ( y -^(x)) d differentiating , » m times the rest of the in- 

i " to XidyF(x,y)+X 3 

tegrand. We get several terms depending on where the derivative lands. If the derivative 
lands on <f>*(x, y), the absolute value of the integrand in the resulting term is bounded by 

CF*(x,y) s ^-^ ) \D^(x,y + ij(x))\ 6s \X\-^2 aik+ ^~ 2)l (4.18) 

Bounding F*(x, y) < C2~ kai ~ lbi and \Dg(x,y + ij)(x))\ < C|A|~ 51 as in the analysis of Ki , 
we get that (4.18) is bounded by 

(j^i-kai-ib^i^k- ^)2 — z I A| — i — (4.19) 
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Integrating (4.19) over S C Iki multiplies this by at most C2 k so the resulting term is 
at most 

( jl2(-ka i -lb i )(^---^)2-k-2l^-\-SS 1 s (4.20) 

Note this is better than the estimate (4.17). We next consider the case where the y- 
derivative lands on the x » J — nr- factor, turning it into — rx 7r^gf— ^rrr ^2 ■ We take 

X 1 d y F(x,y)+X 3 ' o {X^dyFix^+X^) 2 

absolute values and integrate in the y variable as in the proof of the Van der Corput lemma, 
bounding the other factors as was done for (4.19). Since by (4.14) the function d yy F{x, y) 
is never zero on the domain of integration, we have at most finitely many intervals of 
integration on each of which I X a d £? F ^ ,y } n 2 I integrates back into ± x » F / — rpr • Hence 

& l {X 1 dyF(x,y)+X a ) 2 I o AidyF(x,y)+A 3 

the resulting term, as well as the endpoint terms, will be bounded by (4.20), except divided 
by the y-width 2~ l . We conclude that this term is bounded by (4.17), namely 

Qll2(-ka i -lb i ){^--^)2-k-l^-\-SS 1 s (4-21) 

If the y-derivative lands on either the \Dg (x,y + ip(x))\ 5z or (a(\\\ Sl Dg (x,y + tp(x))) — 
a(\X\ Nl Dg (x, y + ip(x)))) factors one estimates the resulting term in very much the same 
way; the fact that Dg (x, y) is a polynomial and a is monotone ensures that the Van der 
Corput lemma proof still applies and we will have boundedly many intervals of integra- 
tion on which the appropriate derivative is nonvanishing. Similarly, since F*(x,y) 2 is a 
polynomial, one can deal with the term where the derivative lands on the damping factor 
F*(x, y) z ^-d^) in a similar fashion. It should be pointed out that in taking these deriva- 

2 

tives we do incur a factor of C\Im(z)\, but this is more than compensated for by the e z 
factor. Hence we once again get the upper bound (4.21). Adding all terms together, we 
see that \K 2 \ and therefore \Iki\ is bounded by (4.17), the estimate we need. 

We rewrite (4.17) in an especially useful form. Recall that by (2.20), on S we 
have C2(- kai ~ lb ^ < F*{x,y) < C'2^- kai ~ lbi \ So we have just shown that 

\hl\<C f \\\--*-^F*(x,y)^ 1 -^ dxdy (4.22) 



We now break into cases d* < 2, and d* > 2, starting with the latter. Adding (4.22) over 
all rectangles, we obtain that {I^Wl is & t most 



C\X\~^-^ I F*(x,y)V-f dxdy (4.23) 

J [OA] x [0,1] 

Note that if s > 1, then — 4 > — -j, and thus since F*(x 1 y) t is integrable over 
[0,1] x [0,1] for all t > —4, the integral in (4.23) is finite and we obtain that (-^(A)! 

1 SSi s 

is bounded by C|A|~2 — . Since the exponent here is less than — this gives what is 
needed for Theorem 4.1. 

Moving on to the d* =2 case, (4.22) becomes 

\hi\<C I \\\-*-^F*{x,y)-*dxdy (4.24) 
J Jki 
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Since the damping factor is just \Dg (x, y + ip(x))\ Sz when d* = 2, from (4.13) we get 



.2 

'fci = e 



/" e -ttiF(**)-iA 2 *-iA 3 (v+V>(*))|£)£ y + ^^^|< 



'SnDinJfei 

x (a(|A| 5l L>5 (x, y + </>(*))) - a(| (x, y + y) dy (4.25) 

Note that due to the cutoff and the presence of the \D$ (x, y + ip(x))\ Sz in the integrand of 
(4.25), this integrand is at most |A| _5<5lS . So just by taking absolute values and integrating 
we get 

\hi\ < C\X\- 5SlS 2- k - 1 (4.26a) 



<C|A|^ il 2- fc - z (4.266) 



Combining this with (4.24), we get 



-SS-t 



|Jfcz| <C|A|— s— / min(l, \\F*(x,y)\~*) dx dy (4.27) 

J[2-i°- 1 ,2-i°]x[2-i- 1 ,2-i] 

Adding this up over all j and k, we obtain that (-^(A)! is at most 

CIM^^ 11 [ wm(l,\\F*(x,y)\-*)dxdy (4.28) 

J[0,1] x [0,1] 

Since (d, d) £ N(F), (d, d) is a convex combination of vertices of N(F). So since F*(x, y) is 
comparable to the sum of |x a y & | over vertices (a, b) of N(F), we have F*(x, y) > C\x d y d \. 
Since we are assuming d < 2 here, we conclude that F*(x,y) > Cx 2 y 2 and as a result 
(4.28) is bounded by 

C\\\ z ^r 1 f min(l,— % — \dxdy (4.29) 
i[o,i]x[o,i] V \>\*xy) 

A direct calculation reveals that the right hand side is bounded above by C"|A| - 5(ln |A|) 2 
(The integral over [| A| — 2 , 1] x [| A| — 2 , 1] is bounded by a constant times the integral of 
— t — over this region, while the integral over the remaining region is bounded by its 

\\\lxy 

area). As a result, (4.29) is bounded by 

C'lAI-i-^lnlAI) 2 (4.30) 
Since the exponent here is less than — | we have proved the desired bounds for the *(A)|. 

Bounds for \I^ (A)|. 

E ■ 

Note that I 2 11 (A) is given by 

/f w (A) =e z2 [ e ^\if(x,y)^\2x^x a y\ F *( x ^ y _ ^( X ))\ z (h-^)\D^ \x,y)\ Sz 
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{a(\X\ s ^D§ (x, y)) - a{\\\ N ^D% (x, y)))cj>*{x, y) dx dy (4.31) 

As we did with 1%* (A), we break the domain of integral (4.31) into rectangles Jki = 
[2~ fc_1 ,2~ fc ] x [2~ l ~ 1 ,2~ 1 ]. Denote the corresponding term of (4.31) by I k u so that 

E w 4i = / 2 Bii (A). 

Note that by (2.21) we have 

F*(x,y) > Cx aMh (4.32) 

As before (a^, bi) denotes the upper vertex of e^. Recall that Eij lies between y = (r — rj)x Mi 
and y = (r + rj)x Mi for some r and rj such that F ei (1, r) 7^ 0, and that by definition of E^, 
ip(x) has a zero of order at least Mi at x = 0. Consequently, \y — ip(x)\ < Cx Mi on E^. 
Thus by (2.11), F*(x,y- if>(x)) < Cx ai+biMi on E %r Combining with (4.32) we get 

F*(x, y - ij>(x)) < CF*(x, y) (4.33a) 

By Lemma 2.4, on the domain of (4.31) we have 

\d xx f(x,y)\>c\x ai (x Mi ) bi (4.336) 
x z 

Equation (4.33a) shows that the damping function F*(x,y — ijj(x)) satisfies the same upper 
bounds that the damping function F*(x,y) did in the Di case. Equation (4.336) shows 
the same thing for the phase (cf (4.14)), reversing the roles of the x and y derivatives. 
Furthermore, the functions that need to be piecewise monotone in x with boundedly many 
pieces in order to perform the Van der Corput argument do satisfy this; Dg (x, y) is a 
polynomial and the second x derivative of —i\if(x,y) — i\2X — iXsy is nonvanishing by 
(4.336). Hence by repeating the Di argument, reversing the roles of the x and y variables, 
we get that Iki is bounded by (4.17). Adding this up like before gives that as in (4.30), 

|/>f IJ (A)| is bounded by C'l A| — i ^(ln|A|) 2 , the estimate we need. 



Bounds for |/ 2 iJ (A)|. 



Mi 



< v\x 



Mi 



}, where 



Thus G(x, y) is a 



Recall the set Fij is of the form {(x, y) : < x < rj, \y — rx 
F e .(l,y) has a zero of order 1 at y = r. Define G(x,y) = F(x,y + rx Mi 
function on the set Hij = {(x,y) : < x < rj, \y\ < u\x\ Mi } such that G ei (l,y) has a zero 
of order 1 at y = 0. Thus N(d y G) has an edge with equation x + Miy = cii + Mibi — Mi 
that intersects the x axis. Conseqently, N(-j^j^) has an edge with equation x + Miy = 
cti + Mibi — Mi — 1 intersecting the x axis. Hence assuming rj was chosen sufficiently small, 
by Lemma 2.3 we may conclude that on we have 



d 2 G 



dxdy 



> C X a i+ M i b i~ M i 



We rewrite this as 



d 2 G 
dxdy 



(x,y) >C-^ m - ) x^(x M ^ 



(4.34) 
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Letting ip(x) = ip(x) + rx Mi , we do a change of variables from y to y + ip(x) and write 



/f'(A) as 




x|Z^(a;,y + ^(20)| 52 (a(|A| 5l Z^ 



As with the D i: the £ under the integral symbol now denotes the square in the new 
coordinates. By (2.21), on we have 



We now break the domain of integration of (4.35) up into rectangles Jk of the form 
[2— fc — x , 2 _fc ] x \-v2- hM \v2- kM % and let I' k the the portion of (4.35) coming from J k . 
Equation (4.36) shows that the damping function F*(x,y + rx Mi ) in (4.35) satisfies the 
same upper bounds the damping function did on the IkM k rectangle for the the Di. (The 
x ~ 2~ k rectangle of the "lower edge" of Di). As for the phase, instead of having a lower 
bound on a second y derivative as in (4.14), we have the substitute (4.34). We still may 
argue as for the IkM k rectangle in the Di case, but with one difference. In the analysis 
of the term called K\ below (4.14), instead of bounding the measure of a sublevel set of 
\Xid y G(x,y) + A3 1 in the y-variable and integrating with respect to x, one bounds the 
measure of the same sublevel set in the x variable using (4.34) and then integrates the 
result with respect to y. 

Furthermore, all relevant factors are piecewise monotone with boundedly many 
pieces. The function Dg (x, y+ip(x)) is a polynomial in y of bounded degree, as is F* (x, y + 
rx Mi ) 2 , while since G(x,y) is just a y shift of F(x,y) by ip(x), if (0, a) denotes the upper 
vertex of N(F) then d y G(x, y) has nonvanishing (a — l)th y derivative. 

Hence after making the above adjustment to the IkMik argument of the Di case, 
for a given k we get the bounds (4.17) for I' k . (The arguments there did not require / to 
be an integer). Adding over all k, as for the \I 2 l (A)| we get that |/ 2 lJ (A)| is bounded by 

I SS ^ s _ 

C |A| _ 2 — (In I A I ) , the needed estimate. 
Bounds for |/ 2 GlJ (A)|. 

G 

For the I 2 13 (A), we separate the hij = 2 and kij > 2 cases as the damping factors 
are different in these two situations. First, we suppose = 2. Then Ip J (A) is given by 



(4.35) 



F*(x,y + rx M *) < Cx a ^x M ^ 



(4.36) 




6z 



a(\X\^D^ (x, y + ^(x))) - a(\\\ N ^D% (x, y + y) dx dy 



(4.37) 
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Observing that \y\ < Cx Mi on Gij, we divide the domain of (4.37) into rectangles J k of the 
form [2- fe - 1 ,2- fc ] x [C 2~ kM ^ C x 2- kM % and let I' k be the corresponding piece of if"' (A), 
so that£ fc i£ = i 2 G -(A). 

Note that the integrand in (4.37) is the same as that of (4.13) for the Di case. In 
particular, the damping function is the same as in the Di case. Also, by (2.19) on Gij we 
have the following analogue of (4.14): 

\d vv F{x,y)\>C T ±^a**{x M *) b * (4.38) 

\ x i ) 

As a result, all estimates used in the Di case for the I k i rectangle, setting / = kMi (the 
lower edge of Di) hold for the term I' k . Thus \I' k \ is bounded by C\IkMik\, an d adding over 

all k we recover C|A|~^ ^(ln|A|) 2 as an upper bound for |if lJ (A)|. This completes the 
proof for the kij = 2 case. 



We may now assume kij > 2, focusing our attention for now on the case when N(F) has 

G 

multiple vertices. Here, i 2 lJ (A) is given by 

\Dg (x, y + ^(x))| 52 (a(|A| 51 ^ + " adAl" 1 ^ + ^rfy 

(4.39) 

We divide the domain of (4.39) into rectangles J k as in the above kij = 2 case, and again 

G ■ ■ 

let I' k be the corresponding piece of t T J (A). Observe that by (2.23), there is some Cq such 
that the magnitude of the bracketed expression in (4.39) (which is the same as the H(x, y) 
in (2.23)) is bounded by Co(x ai+Mibi ) 2~d . Thus we may write I' k = £^ -Pfcz, where Phi 
is the portion of the integral over J k where \H{x,y)\ is between 2~ l+1 C (x ai+Mibi ) 2~d 
and 2- l C (x ai+Mibi )2-*. We will now bound each P kl . To this end, note that on the 
domain of Pki, by the definition of H(x, y) and the P k i we have 

Solving for |fj^(a;,2/)|, we get 

CJ-Vj^x**™* < |0(x,y)| < Cl2- 2 '^x-*^^ (4.41) 

One now bounds -Pfc; by integrating by parts in y in the portion of (4.39) corresponding 
to Pki- One proceeds exactly as for the IkMik term of the Di (the x ~ 2~ k rectangle of 
the "lower edge" of Di), except instead of using \^pr{x,y)\ > C j^Mjy x ai+Mibi from (4.14) 
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one uses (4.41). This gives us an additional factor of C2 l in the resulting bounds for the 
integral. This however is compensated by the damping factor, which by the definition of 
Pki is bounded by C2~ lRe ( z > times the damping factor used for the IkMik term in the Di 
case. Thus the overall integral is bounded by C2" 1 ~ Re ( z ^ times what is obtained for the 
IkMik term in the Di case. We do not have to worry about whether each factor in (4.37) 
is boundedly piecewise monotone in y in our integrations by parts; the only new element 
in this regard is ^p-(x,y), whose (kij — 2)th y derivative is nonvanishing. 

Since Re(z) > 1, we conclude Pki is bounded by a constant times the estimate 
obtained for the IkMik term in the Di situation, and adding this over all k gives 

|/ 2 GlJ (A)| < \ p ki\ < ^lAr^-^QnlAI) 2 (4.42) 

kl 

This is the estimate we seek. The above argument was for when N(F) has multiple vertices, 
but when N(F) just has one vertex the following simplified version of this argument works. 
In the one vertex situation, \H(x, y)\ = \ ^p-(x,y)\. This time we let Pi be the portion of 

the integral defining i^* IJ (A) over the set where \H(x,y)\ is between Co2~ l ~ 1 and Co2~\ 
where Cq denotes the maximum value of \H(x, y)\. Like above, for Pi the decreased second 
y derivative of the phase gives an additional factor of C2 l which is more than compensated 
by the additional C2~ lRe ^ factor coming from the damping function. Adding over all /, 
we recover (4.42). This completes the proof of the bounds for the \I 2 lJ (A)|, which in turn 
completes the proof of Theorem 4.1. 

The proof of Theorem 1.1. 

We may now finish the proof of Theorem 1.1 in short order. First suppose d(F) > 
2. For any r\ > 0, Theorem 3.1 says that on the line Re(z) = — d ^_ 2 + "Hi M z is bounded 
on L°° with uniform constant, while Theorem 4.1 in conjunction with Theorem 1.2 says 
that on Re(z) = 1 + rj, M z is bounded on L 2 with uniform constant. Using interpolation 
for maximal operators (see Ch. 11 of [St2]), we have that Mo is bounded on L d ( F ) +v 
where r( — > as i] — > 0. Thus we conclude Mq is bounded on L p for all p > d(F). Since 
d(F) = h(qo) = max(2, h(qo)), this gives Theorem 1.1 for d(F) > 2. 

On the other hand, if d(F) = 2, Theorem 3.1 says that on any vertical line 
Re(z) = s, M z is bounded on L°° with uniform constant, and Theorem 4.1 still applies 
on a line Re(z) = 1 + rj. Thus interpolation now gives the result obtained by letting d(F) 
approach 2 in the previous paragraph, namely that M is bounded on L p for p > 2 = 
max(/i((/o), 2). This completes the proof of Theorem 1.1. 

5. References. 

[B] J. Bourgain, Averages in the plane over convex curves and maximal operators, J. Anal. 
Math. 47 (1986), 69-85. 



28 



[CoMa] M. Cowling, G. Mauceri, Inequalities for some maximal functions. II, Trans. 
Amer. Math. Soc. 298 (1986), no. 1, 341-365. 

[Gl] M. Greenblatt, Maximal averages over hyper surf aces and the Newton polyhedron, 
submitted. 

[Gr] A. Greenleaf, Principal curvature and harmonic analysis, Indiana Univ. Math. J. 30 
(1981), no. 4, 519-537. 

[IkKeMul] I. Ikromov, M. Kempe, and D. Miiller, Damped oscillatory integrals and bound- 
edness of maximal operators associated to mixed homogeneous hypersurfaces (English sum- 
mary) Duke Math. J. 126 (2005), no. 3, 471-490. 

[IkKeMu2] I. Ikromov, M. Kempe, and D. Miiller, Estimates for maximal functions asso- 
ciated to hypersurfaces in R 3 and related problems of harmonic analysis, Acta Math. 204 
(2010), no. 2, 151-271. 

[IkMu] I. Ikromov, D. Miiller, On adapted coordinate systems, to appear, Trans. AMS. 

[IoSal] A. Iosevich, E. Sawyer, Oscillatory integrals and maximal averages over homoge- 
neous surfaces, Duke Math. J. 82 no. 1 (1996), 103-141. 

[IoSa2] A. Iosevich, E. Sawyer, Maximal averages over surfaces, Adv. Math. 132 (1997), 
no. 1, 46-119. 

[NaSeWa] A. Nagel, A. Seeger, and S. Wainger, Averages over convex hypersurfaces, Amer. 
J. Math. 115 (1993), no. 4, 903-927. 

[PSt] D. H. Phong, E. M. Stein, The Newton polyhedron and oscillatory integral operators, 
Acta Math. 179 (1997), 107-152. 

[So] C. Sogge, Maximal operators associated to hypersurfaces with one nonvanishing prin- 
cipal curvature (English summary) in Fourier analysis and partial differential equations 
(Miraflores de la Sierra, 1992), 317-323, Stud. Adv. Math., CRC, Boca Raton, FL, 1995. 

[SoSt] C. Sogge and E. Stein, Averages of functions over hypersurfaces in R n , Invent. 
Math. 82 (1985), no. 3, 543-556. 

[Stl] E. Stein, Maximal functions. I. Spherical means. Proc. Nat. Acad. Sci. U.S.A. 73 
(1976), no. 7, 2174-2175. 

[St2] E. Stein, Harmonic analysis; real-variable methods, orthogonality, and oscillatory 
integrals, Princeton Mathematics Series Vol. 43, Princeton University Press, Princeton, 
NJ, 1993. 

[V] A. N. Varchenko, Newton polyhedra and estimates of oscillatory integrals, Functional 
Anal. Appl. 18 (1976), no. 3, 175-196. 

Department of Mathematics, Statistics, and Computer Science 
University of Illinois at Chicago 
322 Science and Engineering Offices 
851 S. Morgan Street 
Chicago, IL 60607-7045 



29 



